# Low-resource inference optimization
Kodify Nano GGUF
Apache-2.0
Kodify-Nano-GGUF is the GGUF version of the Kodify-Nano model, optimized for CPU/GPU inference. It is a lightweight large language model suitable for code development tasks.
Large Language Model
K
MTSAIR
161
1
Qwen3 30B A1.5B 64K High Speed NEO Imatrix MAX Gguf
An optimized version based on the Qwen3-30B-A3B Mixture of Experts model, improving speed by reducing the number of active experts, supporting 64k context length, and suitable for various text generation tasks.
Large Language Model Supports Multiple Languages
Q
DavidAU
508
3
Qwen3 128k 30B A3B NEO MAX Imatrix Gguf
Apache-2.0
GGUF quantized version based on Qwen3-30B-A3B Mixture of Experts model, extended to 128k context, optimized with NEO Imatrix quantization technology, supporting multilingual and multitask processing.
Large Language Model Supports Multiple Languages
Q
DavidAU
17.20k
10
Llama 4 Scout 17B 16E Instruct Bnb 4bit
Other
This is the quantized version of the original model meta-llama/Llama-4-Scout-17B-16E-Instruct, optimized with int4 quantization technology, suitable for multilingual tasks.
Large Language Model
Transformers Supports Multiple Languages

L
bnb-community
1,286
1
Llama 3.2 11B Vision Instruct GGUF
Llama-3.2-11B-Vision-Instruct is a multilingual vision-language model that can be used for image-text to text conversion tasks.
Image-to-Text
Transformers Supports Multiple Languages

L
pbatra
172
1
Nvidia Llama 3.1 Nemotron 70B Instruct HF AWQ INT4
This is NVIDIA's AWQ 4-bit quantized version of the Llama-3.1-Nemotron-70B-Instruct model, customized based on Meta's Llama-3.1-70B-Instruct, focusing on improving the usefulness of generated responses.
Large Language Model
Transformers Supports Multiple Languages

N
ibnzterrell
206
5
Kunoichi DPO V2 7B GGUF Imatrix
A 7B-parameter large language model based on the Mistral architecture, trained with DPO (Direct Preference Optimization), demonstrating excellent performance in multiple benchmarks
Large Language Model
K
Lewdiculous
3,705
39
Speechless Coder Ds 6.7b
Apache-2.0
speechless-coder-ds-6.7b is a large language model fine-tuned based on deepseek-ai/deepseek-coder-6.7b, focusing on improving code generation and programming assistance capabilities.
Large Language Model
Transformers Supports Multiple Languages

S
uukuguy
771
7
Genz 70b
GenZ is an advanced large language model fine-tuned from Meta's open-source Llama V2 70B parameter model, designed to provide high-performance text generation capabilities for the open-source community.
Large Language Model
Transformers English

G
budecosystem
1,556
31
Featured Recommended AI Models